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(54) Title: INVERSE LABELING METHOD FOR THE RAPED IDENTIFICATION OF MARKER/TARGET PROTEINS 

r<| (57) Abstract: A novel procedure for performing protein labeling for comparative proteomics termed inverse labeling is provided 
for the rapid identification of marker or target proteins. With this method, to evaluate protein expression of a disease or a drug 

^2 treated sample in comparison with a control sample, two converse collaborative labeling experiments are performed in parallel. 

■"--^ In one experiment the perturbed sample (by disease or by drug treatment) is isotopically heavy-labeled, whereas, the control is 

^ isotopically heavy-labeled in the second experiment. Whem mixed and analyzed with its tmlabeled or isotope light counterpart for 
differential comparison, a characteristic inverse labeling pattern is observed between the two parallel analyses for proteins that are 
differentially expressed to an apreciable leveL In particularly useful embodiments, protein labeling is achieved through proteolytic 
^^O-incoioration into peptides as a result of proteolysis performed in water, metabolic incoroporation of ^^N(or ^^C and ^H) into 

!^ proteins, and chemically tagging proteins with an isotope-coded tag reagent such as an isotope-coded affinity tag reagent. 
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INVERSE LABELING METHOD FOR THE RAPID IDENTIFICAIION OF 

MARKERiTARGET PROTEINS 

BACKGROUND OF THE INVENTION 

1. Field of the InYentiati 

This invention relates to metiiods for identifying specific proteins in con^lex protein 
mixtures. In particular, the methods of the present invention relate to the rapid identification 
of differentially e^^wressed proteins from two different san9>les» e.g,, different tissues, 
different cell types or different cell states, usiog mass spectrometry. 

2. Description of the Related Art 

It has been well established that most disease processes and disease treatments are 
manifest at the protein level. The mechanisms of action for most of the plmrmaceuticals on 
the naarket are indeed mediated through proteins. Comparative analysis of protein profiles 
fix>m normal and disease states, with or without dmg treatment, can facilitate the systematic 
studies of proteins involved in any biological system or disease, revealing new insights into 
disease mechanisms, identifying new targets, providing information on drug-action 
mechanisms and toxicity, and identifying surrogate markers. It is believed that proteomic 
studies will lead to important new insights into disease mechanisms and improved drug- 
discovery strategies for the discovery of novel therapeutics. 

The most common technology platform for proteomic studies to date is the integrated 
use of two-dimensional (2D) gel electrophoresis for profilmg proteins and mass spectrometry 
for protein analysis and identification as described, e.g., in Quadroni, et aL, Electrophoresis^ 
1999, 20:664-677. Ptotem mixtures derived from cells or tissues of normal or disease states 
are separated on 2D PAGE and visualized via staining. Quantitative comparisons of images 
can be made after the images of the displayed proteins are digitally scanned into a computer. 
The spots that are either unique or those that are differentially expressed are then identified. 
Following excision of the spots and in situ digestion, a variety of nmss spectrometric 
techniques can be used to obtain peptide fingerprint and p^tide sequence information which 
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are used to search a sequence database to identify the proteins. As these proteins are disease 
specific, each could potentially become a new target for drag discovery or be used as a 
disease marker. At the present time, 2D-PAGE is still the most comprehensive method for 
displaying proteins. 2D gels have been shown to be highly reproducible since the 
introduction of immobilized pH gradient (IPG) strips for the first dimensional separation. It 
is capable of resolving thousands of proteins and, when stained with silver or fluorescent 
dyes, it provides a sensitive method for quantitating protein expression. Nonetheless, there 
arc still certain shortcomings with the technique. Chief among them is its inability to display 
all protein components, such as membrane proteins, proteins wfth extreme pis, and proteins 
of low copy numbers. Inadequate resolving power is another pitfall with the technique. Up 
to 20-40% of all spots may contain more than one protein, which makes quantitative 
comparison of protein e:q>ressions and interpretation of experiments extremely difHcult. 
Although a lot of progress has been made over the last few years, proteomics using 2D gels is 
still viewed as a difficult technology in terms of automation and throughput. 2D gel 
elTOtrophoresis, staining, and image analysis are just some of the steps that remain to be fully 
automated before the process can be truly called high throughput. Alternatives to this 
technology, particularly to replace the use of 2D gels, are being explored in the hope of 
achieving better throughput and higher sensitivity. 

One approach that omits 2D gels is the use of multi-dimensional liquid phase 
separation techniques such as chromatography and/or solution isoelectric focusing to partially 
resolve mixtures of proteins or ttieir digested peptide products as described, e.g., in Eng et al., 
J. Anu Soc. Mass Spectrom. 1994, 5:976-989; McCormack et ai. Anal Chem. 1997, 69:767- 
776; Opiteck et al., AnaL Chem. 1997, 69:2283-2291; Opiteck et al.,i4jwi Chem. 1997, 
69: 1518-1524; Opiteck et al.. Anal Biochem. 1998. 258:349-361; Kojima et aL, 
7. Chromatogr. 1982, 239:565-570; Isobe et al., /. Chromatogr. 1991, 588:115-123; WaD et 
al.,i47ial CAem. 2000,72:1099-1111; Jensen etal.,An^7i Chem. 1999, 71:2076-2084; and 
Paga-Tolic et al., J. Anu Chem. Soc. 1999, 121:7949-7950. Mass spectrometry CMS) with 
additional resolving power, is used to identify the simplified mixture. Since separation 
occurs in the liquid phase, tiie automation potential is much higher than the gel-based 
platform. When running at preparative scale, sample loading is significantly larger than what 
is achievable with 2D PAGE. In addition, this approach reduces the protein / peptide 
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recovery losses associated with 2D-gel technology since the final separated proteins / 
peptides are in solution. One negative aspect is that the quantitative information gained from 
2D-gel imaging is not yet achievable with these methodologies. 

Isotope dilution has long been used for quantitative analysis of drug in biological 
materials. An internal standard, which is isotopically different in structure, is added to the 
samples to achieve accurate quantitation of a particular compound. Because of the internal 
standard, variables such as sample loss during sample preparation, matrix effects, detection 
interferences, and others, are no longer issues for accurate quantitation. In order to apply the 
same principle to relative protein quantitation, efforts have been made towards the 
development of protein tagging or isotope labeling methodologies. Labeling of a pool of 
proteins can be carried out metabolically or chemically. When evaluating differential 
expression of proteins, two pools of proteins (e.g., a normal vs. a disease state), one labeled 
(with heavy isotope) and the other not (i.e., with natural, light Isotope), are mixed, 
proteolyzed and analyzed. Each pair of peptide signals, with and without label, becomes the 
internal standard for each other and enables the quantitative comparison of protein 
differential expression. While the peptide fingerprint and peptide sequence information 
obtained from MS analysis provides the identification of proteins, the label offers a means to 
differentiate the two populations and perform accurate quantitation on every protein. Protein 
profiling, quantification, and identification are therefore performed in a single step. Oda et 
aL, Proc. Nail Acad. Scl USA 1999, 96:6591-6596, have demonstrated such an ^proach 
where proteins are metabolically labeled during cell culture in a ^^-enriched culture media. 
Similar strategies may also be applied via amino acid specific labeling of proteins achieved 
metabolically during cell culture cultivation as described, e.g*, in Chen et al.. Anal Chem. 
2000, 72:1134-1143. Gygi et aL, Nature Biotech, 1999, 17:994-999, have developed a 
chemical derivatization schecae, termed isotope-coded affinity tagging (ICAT) to carry out 
labeUng on all cysteine-containing proteins. With the approach, relative protem quantitation 
is achieved through the use of two isotopically different, light and heavy tags. The method 
has been applied successfully in a number of cellular systems to obtain quantitative 
comparison of protein expression. The built-in affmity tag in the label enables the reduction 
of peptide mixture complexity by selectively enriching only the cysteine-containing peptides. 
It however also risks losing information on non-cysteine-containing proteins and information 
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regarding protein post-translational modifications. Data analysis can be tedious with these 
methods. There is no built-in mechanism to perform subtractive analysis to achieve a quick 
focus on proteins that change the most in expression. Rather, each peptide pair of light and 
heavy tags has to be identified and relative quantitation performed for all proteins before a 
rank order can be obtained. Dynamic range is another limiting factor with the methods. 
Signals from peptides with both light and heavy isotope tags have to be quantitatively 
detected in order to obtain accurate quantitation of protein expression. In an extreme 
situation where only one signal of the pair is detected, the signal can be confused as a 
chemical background or from a nonnsysteine-containing peptide rather than from a protein 
that has been highly differentially expressed. In addition, the labeling methods n^ntioned 
here all require special reagents (custom-made chemicals or isotopically enriched culture 
media) and extra effort to introduce the labels, which may or may not be readily accessible to 
a protein analytical lab or an MS lab. 

While the above methods permit the identification and quantitation of differentially 
expressed proteins in complex protein mixtures, these methods are delicient in either 
speed/throughput, sensitivity, the ability to cover all proteins or the ability to identify extreme 
changes in expression or protein covalent changes. Accordingly, it would be desirable to 
provide a method for identifying various classes of differentially expressed proteins in 
. complex protein mixtures that is rapid, high throughput, sensitive and capable to identify all 
changes in protein expression (quantitative or qualitative) unambiguously- 

SUMMARY OF THE INVENTION 

The present invention relates to a novel procedure of performmg protein labeling for 
comparative proteomics termed inverse labeling which is utilized to identify differentially 
expressed proteins within complex protein mixtures* Tn particular, the noethod of the present 
invention allows the identification of differentially expressed proteins in two different 
samples, for example, different tissue or cell types, disease or developmental stages. 

The method as described herein below, overcomes disadvantages inherent in currently 
available methods in that it provide rapid, high flnroughput, sensitive, reliable and 
unambiguous identification of various classes of differentially e^essed proteins. 

-4- 
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In one aspect, a method for identilfying a differentially expressed protein in two 
different sairples containing a population of proteins is provided which comprises 
a) providing two equal protein pools from each of a reference sample and an experimental 
sample; b) labeling the protein pools with a substantially chemically identical isotopically 
different labeling i^agent for proteins, wherein one pool from each of the reference and 
experimental pools is labeled with an isotopically heavy protein labeling reagent to provide 
an isotopically heavy-labeled reference pool and an isotopically heavy-labeled experimental 
pool, and wherein the remaining reference and experimental pools are labeled with an 
isotopically light protein labeling reagent to provide an isotopically light-labeled reference 
pool and an isotopically light labeled experimental pool; c) combining the isotopically light- 
labeled reference pool with the isotopically heavy-labeled experimental pool to provide a first 
mixture; d) combining the isotopically heavy-labeled reference pool with the isotopically 
light-labeled experimental pool to provide a second mixture; e) detecting the labeled proteins 
from each of the two mixtures; and f) conq)aring the labeling pattern obtained for the labeled 
proteins in the first and second mixtures, wherein an invei^e labeling pattern of a protein in 
the second mixture compared with the labeling pattern of the protein in the first mixture is 
indicative of the differentially expressed protein in the two different samples. 

In another aspect, a method for identifying a differentially expressed protein in two 
different samples containing a population of proteins is provided which comprises 

a) providing two equal protein pools from each of a reference sample and an experimental 
sample; b) proteolyzing each protein pool during labeling of each of the protein pools with 
isotopically-Iabeled water, wherein one pool from each of the reference and experimental 
pools is labeled with *^0-water to provide an ^^Olabeled reference pool and an ^^O-labeled 
experimental pool, and wherein the remaining reference and experimental pools are labeled 
with ^^O-water to provide an *^0-iabeled reference pool and an ^^O-labeled experimental 
pool; c) combining the^^O-labeled reference pool with the ^^O-labeled experimental pool to 
provide a first mixture containing ^^O- and *^0-labeled peptides; d) combining the 
^^O-labeled reference i)ool with the ^^O-labeled experimental pool to provide a second 
nnxture containmg ^^O- and ^^O-labeled peptides; e) detecting the labeled peptides from each 
of the two mixtures; and f) comparing the labeling pattern obtamed for the labeled peptides in 
the first and second nndxtures, wherein an inverse labeling pattern obtained for a peptide in the 
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second mixture compared with the labeling pattern obtained for the peptide in the first 
mixture is indicative of the differentially expressed protein from which the peptide 
originated. 

In another aspect, a method for identifyhig a differentially expressed protein in two 
different samples containing a population of proteins is provided which comprises 
a) providing two equal protein pools from each of a reference sample and an experimental 
sample; b) proteolyzing the proteins in each of the protein pools to provide peptide pools; 

c) labeling each peptide pool with isotopically-Iabelcd water, wherein one peptide pool from 
each of the reference and experimental pools is labeled with ^^O-water to provide an 
^^O-labeled reference peptide pool and an ^^O-labeled experimental peptide pool, and 
wherein the remaining reference and experimental peptide pools are labeled with ^^O- water to 
provide an ^^O-labeled reference peptide pool and an ^^Olabeled experimental peptide pool; 

d) combining the ^^O-labeled reference pool with the *^0-labeled experimental pool to 
provide a first mixture containing ^^O- and ^^O-labeled peptides; e) combining the 
^^O-labeled reference pool with the ^^O-labeled experimental pool to provide a second 
mixture containing ^^O- and ^^O-labeled peptides; f) detecting the labeled peptides from each 
of the two mixtures; and g) comparing the labeling pattOTi for the labeled peptides in the first 
and second mbrture, wherein an inverse labeling pattern obtained for a peptide in the second 
mixture compared with the labeling pattern obtained for the peptide in the first mixture is 
indicative of the differentially expressed protein from which the peptide originated. 

In yet another aspect, a method for identifying a differentially expressed protein in two 
different samples containing a population of proteins is provided which comprises 
a) providing two equal protein pools from each of a reference sample and an experimental 
sample wherein one pool from each of the reference and experimental pools is produced by 
cultivation in a culture medium containing an isotopically heavy-labeled assimilable source 
to provide an isotopically heavy-labeled reference pool and an isotopically heavy-labeled 
experimental pool, and wherein the remaining reference and experimental pools are produced 
by cultivation in a culture medium containing an isotopically light-bbeled asshnilable source 
to provide an isotopically light-labeled reference pool and an isotopically light-labeled 
experimental pool; b) combining the isotopically light-labeled reference pool with the 
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isotopically heavy-labeled experimental pool to provide a first protein mixture; c) combining 
the isotopically heavy-labeled reference pool with the isotopically light-labeled experimental 
pool to provide a second protein mixture; d) detecting the labeled pro^ins from each of the 
two mixtures; and e) comparing the labeling pattern obtained for the labeled proteins in the 
first and second mixture, wherein an inverse labeling pattern of a protein in the second 
mixture compared with the labeling pattern of the protein in the first mixture is indicative of 
the differentially expressed protein in the two different samples. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 . The inverse labeling method for rapid identification of marker/target proteins. For 
illustration purposes, proteins that remain unchanged in the two protein pools are shown in 
etpal abundance. (In practice, they may not necessarily be present in equal abundance; 
rather, they may be present at a constant ratio that is not equal to one.) Protein proteolytic 
*^0-labeling is used in this schematic diagram for illustration. 

Figure 2. Liquid Chromatography/Mass Spectrometry (LC/MS) detection of an inverse 
^^OJabeled BS A tryptic peptide. (A): MS of the ^^O-control - ^^0-"treated** sample; 
CB): MS of the ^*0-control - ^^0-**treated" sample; (C): MS/MS of the peptide in (A); and 
(D): MS/MS of the peptide in (B). A 2-Da mass shift between (A) and (B) on the most 
abundant isotopic ions indicates a significant differential expression of the protein. The mass 
shift is further verified/confirmed in the MS/MS spectra (C) and (D) by the 2-Da shift of all 
Y ions, which also helps to identify Y ions and B ions and thus helps in the interpretation of 
the spectra. The BS A protein is exclusively identified trom database searching using the Y 
ions (those with a 2-Da shift). 

Figures. LC/MS detection of an inverse "O-labeled aldolase tryptic peptide. (A): MS of 
the ^^O-control- *^0-»*treated" sainple; (B): MS of the ^^O-control - '^O-^reated" sample; 
(Q: MS/MS ofthe peptide in (A); and (D): MS/MS of the peptide in (B). A 4-Da mass shift 
between (A) and (B) on the most abundant isotopic ions indicates a significant differential 
expression ofthe protein. The mass shift is further verified/confirmed in the MS/MS spectra 
(Q and (D) by the 4-Da shift of all Y ions, which also helps to identify Y ions and B ions and 
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thus helps in the interpretation of the spectra. Aldolase protein is exclusively identified from 
database searching using the Y ions (those with a 4-Da shift). 

Figure 4. MALDI TOF detection of inverse ^^O-Iabeled tryptic digests of the 8«protein 
mixtures. (A): ^^O-control- ^^O-'treated" sample; (B): ^^O-control - ^^0-**treated" sample; 
(C): monoisotopic patterns of a BSA peptide MH* I567»9 in (A) (upper) and (B) (lower); 
and (D): monoisotopic patterns of an aldolase peptide MH*** 21073 in (A) (upper) and (B) 
(lower). The mass shifts or ^^O/^^Ointensity ratio reversal indicates differential expression 
of the proteins: "down-regulation** of BSA and **up-regulation" of aldolase. 

Figure 5, MALDI PSD spectra of an inverse ^^O-iabeled aldolase tryptic peptide MH*^ 
2107.3. (A): in the ^^O-control- *^0**treated" sample; and (B): in the *^0-control - 
^^O-'^treated'* sample. The 4-Da mass shift observed on the nK>lecuIar ion in Figure 4 (D) is 
further verified/confirmed in the PSD spectra by the 4-Da shift of alt Y ions. This also helps 
to identify Y ions and B ions and thus helps in the interpretation of the PSD spectra. The 
aldolase protein is exclusively identified from database searching using the Y ions (those 
with a 4-Da shift). 

Figure 6. LC/MS detection of a PTP (protein tyrosine phosphatase) tryptic peptide from a 
CHO cell lysate spiked with PTP- IB. (A): MS of the ^^O-PTPIO- *^OPTP30 sample; 
(B): MSofthe^^O-PTPlG-^^O-PTP30san^le;(C): MS/MS of the peptide in (A) in-set; 
and (D): MS/MS of the peptide in (B) in-set, where PTPIO is a 0.25 mg CHO cell lysate 
spiked with 10 pmol of PTP- IB; PTP30 is a 0,25 mg CHO cell lysate spiked with 30 pmol of 
PTP-IB. After spiking, the protein mixtures are proteolyzed, and subsequently inverse *^0- 
labeled to form the two mixtures A and B. A 4-Da mass shift between (A) and (B) (inserts) 
on the most abundant isotopic ions indicates a significant "differential expression** of the 
protein. The mass shift is fiirther verified/confirmed m the MS/MS spectra by the 4-Da shift 
of all Y ions, which also helps to identify Y ions and B ions and thus helps in the 
interpretation of the MS/MS spectra. PTP-IB protein is exclusively identified from database 
searching using the Y ions (those with a 4-Da shift)* 
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Figure 7. MALDI TOF detection of tryptic digests of an in verae ^^-labeled two-protein 
system with FTP protein 3-fold up-regulated in the "treated". (A): ^^-control- 
^^N-*Hreated" sample; (B): ^^N-control - ^'^N-^treated" sample. The lower panels are the 
selective zoon^-in m/z regions. 

Figure 8. MALDI TOF detection of tryptic digests of an inverse ^^N-labeled two-protein 
system with FTP protein 100-fold down-regulated in the 'treated'*, (A): ^"^-control - 
^^N-'treated" sample; (B): ^^N-control- ^^N-"treated" sample. The lower panels are the 
selective zoomed-in m/z regions. 

Figure 9. LOMS-MS/NK detection of tryptic digests of an inverse ^^N-labeled two-protein 
system with FTP protein 3-fbld down-regulated in the "treated*'. (A): MS of the 
^"^N-control - ^^N-'treated'* sample; (B): MS of the ^^N-control - ^'^-^^treated" sample; 
(a) base-peak ion chromatograms of the two LC/MS-MS-MS runs; (b) MS spectra of a 
peptide in (a) displaying the inverse labeling pattern (mass shift); and (c) MS/MS spectra of 
the peptide in (b) (on the doubly charged ion). The FTP protein is exclusively identified jfrom 
database searching using the MS/MS data of the ^"^N-peptide (upi>er (c)). 

Figure 10. LC/MS-MS/MS detection of tryptic digests of an inverae^^NT-labeled algal cell 
lysate spiked with FTP protein, with FTP 3-fold down-regulated in the "treated". (A): MS of 
the ^'^N-control - ^^N-'*treated" san^le, averaged spectrum over a 3-min IX/MS window; 
(B): MS of tho ^^N-contiol - ^"^N-'treated" sample, averaged spectrum over a 3-min window; 
(Q: MS/MS of the peptide in (A) m/z 623.5; and (D): MS/MS of the peptide m (B) m/z 
631.3; where ^%-controI is a 0.05 mg ^^C-algal protem spiked with lOpmol of FTP- IB; 
^^N-control is a 0.05 mg "C-**N-algal protein spiked with 10 pmol of *^-FTP; ^^.'^treated" 
is a 0.05 mg '^C-^lgal protein spiked with 0.3 pmol of PTP-IB; and ^^N-**treated" is a 
0.05 mg ^^C-^^N-algal protein spiked with 0.3 pmol of ^^N-PTP. Mass shifts or inverse 
labeling pattern between (A) and (B) were observed on the marked ions (*). The inverse 
labeling or differential expression is further verified/confirmed in the MS/MS spectra by their 
similar fragmentation pattern. FTP- IB protein is exclusively identiJBed from database 
searching using MS/MS data of the ^"^-peptide (C). 

Figure 1 1 . MALDI TOF detection of tryptic digests of an invei^ ICAT-labeled six-protein 
system. (A): Do-control-Ds- treated** sample; (B): Dg-control - Dq- treated" sample. The 
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lower panels are the selective zoomed-in m/z regions. The mass shifts or Do-ZOg-intensity 
ratio reversal indicates differential expression of proteins. 

Figure 12, LC/MS detection of tryptic digests of an inverse ICAT-labelcd six-protein system. 
(A): Base-peak ion chromatogram of the Do-control - Ds-"treated'* sample; (B): Base-peak 
ion chromatogram of the Dg-control - Do-"treated*' sample. Signals of the characteristic 
inverse labeling pattern of mass shifts are clearly detected. The differentially expressed 
proteins are quickly identified using their MS data, 

DESCRIPTION OF TOE INVENTION 

All patent applications, patents and literature references cited herein are hereby 
incorporated by reference in their entirety. 

The term "differentially expressed" with respect to protein(s) refers to quantitative 
changes in expression level as well as qualitative changes such as covalent changes, e.g., 
post-translational modifications such as protein phosphorylation, protein glycosylation, 
protein acctylation and protein processing of the C- or N-terminal of a protein. 

The term "sample" as used herein, is used in its broadest sense. Suitable samples mclude, 
but are not limited to, cell homogenates; cell firactions; tissue homogenates; biological fluids 
such as blood, urine, and cerebrospinal fluid; tears; feces; saliva; and lavage fluids such as 
lung or peritoneal lavages. 

The term "stable isotope" refers to a non-radioactive isotopic form of an element. 

The term "radioactive isotope" refers to an isotopic form of an element that exhibits 
radioactivity, Le., the property of some nuclei of spontaneously emitting gajomia rays or 
subatomic particles (e.g., alpha and beta rays). 

The term "isotopically light protein labeling reagent** refers to a protein labeling reagent 
incorporating a light form of an element, e.g., H, ^^C, ^"^N, ^^O or ^^S. 

The term "isotopically heavy protein labeling reagent" refers to a protem labeling reagent 
incorporating a heavy form of an element, e.g., ^H, ^^C, ^^N, ^'^O, ^^O or ^S. IsotopicaUy 
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light and isotopically heavy protein labeling reagents are also referred herein as unlabeled 
and labeled reagents, respectively. 

The term "inverse labeling pattern" means a qualitative mass shift or an isotope peak 
intensity ratio reversal, i.e., from the heavy-labeled signal being stronger to the light-labeled 
signal being stronger (or vice versa), detected between the two inverse labeled mixtures. 

The present invention relates to a novel procedure of performing protein labeling for 
comparative proteomics known as inverse labeling, which allows for the rapid identification 
of marker or target proteins, those in which expression levels have significantly changed 
upon a perturbation or those in which covalent changes have occurred upon a perturbation, 
e.g,, as a result of either a disease state or drug treatment, contact with a potentially toxic 
material, or change in environment (e.g,, nutrient level, ten5)erature, passage of time). The 
rapid identification of differentially expressed proteins can be applied toward the revealment 
of new disease mechanisms, the elucidation of drug-action mechanisms and the study of drug 
toxicity. The method involves performing two converse collaborative labeling experiments 
in parallel on two different sarr^les each containing a population of proteins. The two 
different samples are designated as the reference and experimental samples. These samples 
can differ in cell type, tissue type, organelle type, physiological state, disease state, 
developmental stage, environmental or nutritional conditions, chemical or physical stimuli or 
periods of time. For example, the reference and experimental samples can represent normal 
cells and cancerous cells, respectively; treatment without and with a drug, respectively, and 
the like. 

The niethod comprises providing two equal protein pools from each of the reference 
and experimental samples. Bach protein pool is then labeled with a protein labeling reagent, 
which is substantially chemically identical, except that it is distinguished in mass by 
incorporating either a heavy or light isotope. The isotope can be a stable isotope or a 
radioactive isotope. Incorporation of a stable isotope into the protein labeling reagent is 
preferred because it is stable over time thereby mitiimizing variations due to handling and 
thus provides more accurate quantitative measurements and is more environmentally safe 
than a radioactive isotope. 
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With respect to labeling of the protein pools, one protein pool from each of the 
reference and experimental samples is labeled with an isotopically heavy protein labeling 
reagent to provide an isotopically heavy-labeled reference pool and an isotopically heavy- 
labeled experimental pooL The remaining pool from each of the reference and e;cperimental 
samples is labeled with an isotopically light protein labeling reagent to provide an 
isotopically light-labeled reference pool and an isotopically light-labeled experimental pool. 

The protein labeling reagent can be any suitable reagent utilized to label proteins* 
The isotope is included in the reagent and thus is incorporated into the proteins. The labeling 
may be achieved chemically, metabolically, proteolytically or other suitable means to 
incorporate isotope into the proteins* 

In one embodiment, the protein labeling reagent can be a reagent that contains a group 
that reacts with a particular functional group of a protein, ie., chemical labeling of the 
protein. Exan^les of reactive groups of protein labeling reagents include those that react 
with sulfhydryl groups, amino groups, carboxylic acid groups, ester groups, phosphate 
groups, aldehyde and ketone groups and the like. Examples of thiol reactive groups include, 
but are not limited to, nitriles, sulfonated alkyl or aryl thiols, maleimide, epoxides and alpha- 
haloacyl groups. Examples of amino reactive groups include, but are not limited to, 
isocyanates, isothiocyanates, active esters, e.g., tetrafluorophenylesters and 
N-hydroxylsuccinimidyl esters, sulfonyl halides, acid anhydrides and acid halides. Examples 
of carboxylic acid reactive groups include, but are not limited to, amines or alcohols in the 
presence of a coupling agent such as dicyclohexylcarbodiimide, or 2,3,5,6-tetrafluorophenyl 
trifhioracetate. Exan^les of ester reactive groups include, but are not limited to, amines 
which react with homoserine or lactone. Examples of phosphate reactive groups include, but 
are not limited to, chelated metal where the metal, e.g., Fe(III) or Ga(IH> is chelated to 
nitrilotriacetic acid or iminodiacetic acid. Aldehyde or ketone reactive groups include, but 
are not linoited to, amines and NaBH4 or NaCNBH*, such as described in Chemical Reagents 
for Protein Modification by R. Lundbald (CRC Press 1991). 

One particularly useful type of protein labeling reagent is the affinity tag-containing 
recent. Use of an affinity tag-containing reagent is particularly advantageous, in that 
specific classes of proteins, e.g., those contaming phosphate groups, can be subjected to 
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affinity purification, which can eliminate undesirable proteins thereby reducing the 
complexity of the protein pools and further enriching for particular classes of proteins. In 
addition, such affinity tag-containing reagents can also eliminate undesirable contaminants ^ 
that are inconiqpatible or that would mask identification of specific proteins with mass 
spectrometry. For example, the above protein pools can be biotinylated with an isotopically 
heavy and isotopically light biotin-containing protein labeling reagent. Biotinylated-Iabeled 
protenxs present in the protein pools can then be purified by biotin-avidin chromatography. 
The same principle can apply to peptides after proteolysis of the labeled protein mixtures to 
enrich particular classes of peptides or to reduce the mixture complexity, and thus potential 
interference on the identification of ^>ecific proteins with mass spectrometry. 

The affinity tag for selective isolation of a protein or peptide modified with a protein 
labeling agent can be introduced at the same time as isotope ijotcorporation, or, in a separate 
reaction prior to or post protein isotope labeling. In the case of a specific afiOnity tag reagent 
known as isotope-^oded affinity tag (ICAT) reagent as described by Gygi et al, supra^ the 
biotin affinity tag is part of the protein labeling reagent and is thus introduced at the sams 
time as isotope labeHng. Johnson et al. and Shaler ^ al., 2001, The 49^ ASMS Confearence 
on Mass Spectrometry and Allied Topics, Caiicago, Illinois; both describe affinity tags which 
are introduced prior to isotope labeling through amino acid-specific chemistry. After affinity 
enrichment of the tag-containing proteins^eptides, isotope labels can be introduced through a 
general modiflcatk>n scheme, such as N-terminal acylation, C-terminal esteriflication, or 
cysteine chemistry if a cleavable tag is employed as described, e.g., in Johnson et al, supra. 
Affinity tagging can also occur post isotope teheling. IiKslnded in such examples is the use of 
cysteine-specific biotinylation reagent to react and pool out cysteine-containing 
proteins/peptides after a general labeling procedure is performed such as N-terminal 
acylation, C-terminal esterification, or other non-chemical labeling methods such as 
metabolic *^-labeling as described, e.g., in Conrads et al., Anal. Chem. 2001, 73:2132-2139. 

An example of a specific affinity tag-containing protein labeling reagent that has been 
used to label proteins derived firom different samples for study of protein differential 
expression is the ICAT reagent as described, e.g., in Gygi et al, supra\ and WO 00/1 1208. 
The structure of an ICAT reagent consists of tibree functional elements: 1) a biotin affinity 
tag, 2) a linker incorporatfag either H.or and 3) a protem reactive group, e,g„ a sulfhydryl 
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reactive group. In the ICAT method, the side chains of amino acid residues, e.g., cysteinyl 
residues, in a reduced protein sample are modified with the isotopically light form of the 
ICAT reagent. The same groups in a second protein sample are modified with the 
isotopically heavy form of the ICAT reagent. The two-labeled protein samples are combined 
and then proteolyzed to provide peptide fragments, some of which are labeled. The labeled 
(cysteine-containing) peptides are isolated by avidin affinity chromatography and then 
separated and analyzed by LXJ-MS/MS, An example of an ICAT reagent is biotinyl- 
iodoacetylamidyl-4,7,10 trioxatridecanediamine which consists of a biotin group for affinity 
purification, a chemically inert spacer which can be isotopically-labeled with stable isotopes 
for mass spectral analysis and an iodoacetamidyl group for reaction with sulfhydryl groups 
on proteins as described, e.g., in WO 00/1 1208. Similar strategies can be applied to the use 
of other reagents that contain different reactive groups for proteins. 

In another embodiment, the protein labeling reagent can be a reagent that is able to be 
incorporated into the protein, e.g*, by metabolic labeling of the protein pools^ For example, 
the protein pools from the reference and experimental samples can represent different types 
of cells that are cultured in a culture medium containing an isotopically heavy or light- 
labeled assimilable source including, but not limited to, ammonium salts (e.g., ammonium 
chloride), glucose, or water, or one or more isotopically heavy- or light-labeled amino acids, 
e,g,, cysteine, methionine, lysine, etc., to provide Id^eled proteins incorporating the heavy or 
light isotope, such as and ^"^N, *^C and ^^C, and H, or and ^^S, respectively* 

In a particularly useful embodiment, proteins are l^eled as a direct result of 
proteolysis that is performed with the protein labeling reagent, ^*0- and ^^O-labeled water, as 
described e.g., in Rose et al„ Biochem. J, 1983, 215:273-277; and Rose et al., Biochem. J. 
1988, 250:253-259 and as set forth in more detail below. 

Once labeling of the pools is compMed^ the isotopically light-labeled reference pool 
is combined with the isotopically heavy-labeled experimental pool to provide a first mixture* 
The isotopically heavy-labeled reference pool is then combined with the isotopically light- 
labeled experinsental pool to provide a second mbcture. Accordhigly, in the first mixture, the 
isotopically heavy-labeled proteins are derived from the experimental pool, whereas in flie 
second mixture the isotopically heavy-labeled proteins are derived from the reference pooL 
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Through isotopic labeling, the identical protein in the reference and e^rimental samples is 
distinguished by mass to allow their independent detection and quantitative coniparison 
between two samples by suitable techniques, e.g., mass spectrometric techniques. 

The proteins in the first and second mixtures are preferably enzymatically or 
chemically cleaved into peptides by utilizing proteases, e,g., trypsin; chemicals, e-g,, 
cyanogen bromide; or dilute acids, e.g., hydrogen chloride. Preferably, the labeled proteins 
are digested with trypsin. Typical trypsin:protein ratios (wt:wt) that are added to each protein 
solution range from about 1:200 to about 1:20. Digestion is allowed to proceed at about 37°C 
for about 2 to about 30 hours. Digestion of the proteins into peptides can also be carried out 
prior to or during labeling of each of the protein pools of the reference and experimental 
samples as is described in more detail below. The digestion step can be eliminated when 
analyzing small proteins. 

The digested labeled peptides or labeled proteins from the first and second mixtures 
are then detected by any suitable technique capable of detecting the difference in mass 
]yei^CQXi the isotopically labeled peptide or labeled protein derived from the reference and 
experimental samples. Preferably, the digested labeled peptides or labeled proteins are 
separated and subsequently analyzed by well known fractionation techniques as described 
below coupled with MS techniques which are well known in the art. While a number of MS 
and tandem MS (MS/MS) techniques are available and may be used to detect the peptides. 
Matrix Assisted Laser Desorption Ionization MS (MAUDIZMS) and Electrospray ionization 
MS are preferred. The quantitative comparison of the separated labeled peptides or separated 
labeled proteins are reflected by the relative signal intensities for peptide or protein ions 
having the identical sequence that are labeled with the isotopically heavy and light labeled 
protein reagent. The chemically identical peptide or protein pairs are easily visualized during 
a mass spectrometric scan because they coelute or closely elute by chromatography and they 
differ m mass. If expression of a protein has been up or down regulated, ie*, a true shift in 
signal intensities of the light isotope and heavy isotope is observed in the first mixture, the 
inverse should be observed in analyzing the second mixture due to inverse labeling. If 
eTqnression of a protein remains unchanged following a perturbation, th^e will be no 
significant difference in the labeling pattern between the first and second mixtures. 
Accordingly with inverse labeling, instead of quantitatively calculating the ratio of the 
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isotopically light to isotopically heavy signals for every peptide as is carried out in prior art 
isotopic labeling methods for identifying the differentially expressed proteins, two data sets 
are readily compared to quickly identify peptides of such qualitative changes that are 
indicative of differentially expressed proteins. 

Selective mass spectrometric detection may also be used to selectively detect a 
particular group of peptides after a general labeling scheme, such as by precursor ion 
scanning for the detection of phosphopeptides or glycopeptides as described, e,g., in Wilm, et 
al.. Anal. Chem. 1996, 68: 527. 

The sequence of one or more labeled small proteins or labeled peptides is detemcdned 
by standard techniques, e.g., tandem mass spectrometry (MS/MS) or post source decay 
(PSD). At least one of the peptide sequences derived from a differentially expressed protein 
will be indicative of that protein and its presence in the reference and experimental samples. 
In addition, peptide fingerprint data can be generated by MS. Subsequently, data generated 
by MS of peptide fingerprints or peptide sequence information can be used to search a protein 
database for protein identification. 

In a particularly preferred embodiment of the present method as exemplified below, 
protein pools of the reference and experimental samples are proteolyzed \ising trypsin prior to 
or at the same time of labeling with ^^O- and ^^Owaten One ^®0-atom and one ^^O-atom is 
incorporated Into the newly formed carboxy terminus as a consequence of hydrolysis during 
proteolysis. An additional ^^O and ^^O may be incorporated into the terminal carboxy group 
through a mechanism of protease-catalyrcd exchange as described, e.g„ in Rose et al.,1988, 
siq?ra. Thus, following digestion by trypsin all of the resulting peptides except for C-terminal 
peptides that lack Lys or Arg at the C-terminus are labeled with either one or two ^^O- and 
*^0-atoms at the C-terminus (mostly two if enough time is allowed for exchange). Mainly for 
the purpose of conserving the expensive ^^O-water, both during-proteolysis and post- 
proteolysis incorporation of ^^O-labels have been explored. According to previous studies, 
^*0-Jabels msy be incorporated into peptides at the C-terminal carboxy group through 
protease-catalyzed exchange, (See, e.g.. Rose et al., 1988, supra; and Schnolzer et al. 
Electrophoresis 1996, 17:945-953.) This is confirmed by the observation that the naajority of 
the non-C-tenninal peptides are found to have incorporated more than one ^*0-atom when a 
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protein is digested in ^^O-water. By adding a very small volume of O-water (-10 \xl) to a 
completely dried peptide mixture post-proteolysis (with or vs?ithout additional trypsin) and 
allowing the exchange to occur at room temperature for 5-12 hours, the same level of 
'^O-incorporation is achieved as that of during-protcolysis labeling. 

The post-proteolysis labeling can be very advantageous when dealing with proteins or 
protein mixtures for which reduction in volume is problematic. By doing post-proteolysis 
labeling, digestion can be carried out in the normal way in a regular water buffer, on cell 
lysate, or on membrane proteins, without worrying about protein precipitation dtiring 
concentration or the use of a large quantity of the expensive ^^O-water to reach an 
overwhelming ^*0-environment for labeling. Once proteins are proteolyzed to peptides, 
concentration and precipitation is normally less of a problem, and the labeling process via 
protease-catalyzed exchange can be carried out using a very small amount of ^*0-water. 
AiK>ther area where post-proteolysis labeling may prove to be very useful is in the 
performance of ^*0-labeling experiments on gel-separated proteins via in-gel digestion. By 
carrying out ^^O-labeliag post-proteolysis, the amount of ^^O-water required is substantially 
reduced, since the labeling is perforaied on the dried, extracted peptides. In contrast, the 
labeling will be performed on gels for during-proteolysis labeling where enough '^O-water 
has to be used to cover all swollen gel pieces. 

Theoretical calculations on peptides up to 3,000 Da in si2» indicate that a change of 
2.5-fold or higher in protein expression is likely required to achieve a clear and reliable 
observation of the characteristic ^^OV^^O-intensity reversal or mass shift on the peptides 
between the two experiments. Accordingly, in the two model systems exemplified below, a 
value of three-fold is chosen for use. In both cases, peptides of the characteristic inverse 
labeling pattern are clearly detected, and with the data, the expected proteins are exclusively 
identified from the databases. In reality, protein differential expression, typically with a two- 
fold or greater difference in expression levels, is considered to be statistical significant. In a 
typical proteondc analysis involving disease and norn^ mammalian material, 50-300 such 
unique or differentially expressed proteins may be identified as described, e.g*, in Page et al.. 
Drug Discovery Today 1999, 4:55-62. A cut-off value such as five-foM or greater in protehi 
changes may be applied to focus on the most important proteins. 



-17- 



wo 02/052271 



PCT/EPOl/15228 



Additional fractionation schemes at the protein or peptide level may be required in 
order to reduce the complexity of the proteins in the reference and experimental samples, and 
complexity of protein mixtures or peptide mixtures that reach the mass spectrometer to 
reduce the chances of interference of separated peptides or small proteins and thus clear 
detection of the inverse labeling pattern and the identification of the proteins- Conventional 
fractionation techniques for reducing the complexity of protein mixtures include, but are not 
limited to, ammonium sulfate precipitation, isoelectric focusing, size exchision 
chromatography, ion exchange chronaatography, adsorption chromatography, reverse phase 
chromatography, affinity chromatography, ultrafiltration, immunopr ecipitation and 
combinations thereof Conventional fractionation techniques for reducmg the comtplexity of 
peptide mixtures include, but are not limited to, size exclusion chromatography, ion exchange 
chromatography, adsorption chromatography, reverse phase chromatography, afHnity 
chromatography, immunoprecipitation and combinations thereof. For exart5>le, generic 
afSnity procedures can be applied after a general labeling scheme to isolate a particular class 
of peptides. Such examples include the use of immobilized ir^tal affinity columns (IMAQ 
to enrich phosphopeptides. and the use of Con A beads for isolating glycosylated peptides as 
described, e,g., in Chakraborty et al; and Regnier, 2001 , The 49* ASMS Conference on Mass 
SiJectrometry and Allied Topics, Chicago, Illinois. 

The inverse labeling method is schematically illustrated in Figure 1. In this method, 
each of the two protein pools that are to be differentially compared (e.g„ a control vs. a 
disease state) is divided into two equal portions. One portion from each of the two pook is 
labeled with, e,g., a reagent containing a heavy isotope, e.g., "O, by the above method while 
the remaining portion is not labeled, i.e., labeled with a li^t isotope, eg*, (Figure 1). 
Then a portion from the control and a portion from the perturbed are combined so that in the 
first experiment the labeled proteins are derived from the perturbed pool and, in the second 
experiment, the labeled proteins are derived from the control pooL If expression of a protein 
has been significantly up or down regulated by the perturbation (Le., a true shift in signal 
intensities of ^^O and is observed in one analysis), the inverse should be observed in the 
analysis, of the other sample due to the mvei^e labeling. 
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As depicted in Figure 1, the rapid Identification of differentially expressed proteins is 
achieved via quick identification of peptides derived from those proteins that exhibit the 
characteristic inverse labeling pattern. For most proteins, their expression level remains 
unchanged following perturbation which is reflected by a similar abundance proffle of pool 1 
and pool 2. Therefore, there will be no significant difference in tite labeling pattern between 
the two inverse labeling experiments (i.e*, similar abundance of ^^O- and ^*0-signals in both 
experiments), and these signals can be subtracted but, in principle, by the conqjarative 
analysis of the two data sets. The C-temiinal p^tides without ^*0-labeling are subtr^ted out 
as well. For a protein in which the level of expression has been significantly up or down 
regulated by the perturbation, changes in the ^^O- and ^^O-signal intensities will be observed. 
When the control is not labeled and the perturbed is ^^O-labeled, the ^^O-signal will be of 
greater intensity if the protein is up-regulated; conversely, the ^^0~signal will be stronger if a 
down-regulation has occurred. The inverse will be observed in the second analysis where the 
labeling is reversed. Depending on the direction of the intensity-ratio reversal between the 
two analyses, the direction of differential expression of the protein (i.e., up-regulation or 
down-regulation) can be determined. For example, if a protein is substantially up-regulated 
by a disease state in pool 2 in comparison to the control pool 1, and when the disease sample 
is ^^O-labeled, higher intensities of the ^^Osignals for all peptides from this protein will be 
observed except for the C-terminal peptide. When the labeling is inverted in the second 
experiment in which the control pool is ^^Olabeled while the disease pool is not labeled, tte 
^^O-signals will be stronger for those peptides. Thus, there is a 2/4 Da downward n^ss shift 
of the more intense isotopic ion between the two inverse labeling experiments (ie., from 
*^0-signal in the first experiment to '^O-signal in the second experiment). In reality, for 
peptides of higher masses (e.g., 1300 Da or larger), the mass shift between the two analyses 
on the most intense ion may be detected as 1/3 Da rather than 2/4 Da due to the 
^^C-interference when the protein differential expression is not sufficiently significant to omit 
the "C-effect. The mass shift of the most intense isotopic ion here reflects the intensity-ratio 
reversal. With this procedure, imtead of quantitatively calculating the ratio of the *^0- to 
^*0-signals for every peptide, one only needs to compare the two data sets and identify 
peptides of the characteristic mass shift, which can be achieved rapidly and potentially 
automatically- The direction of the shift implicates either an up- or down-regulation of the 
ejected proteins. 
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In identifying differentially expressed proteins, the inverse labeling approach using 
any suitable labeling method overcomes difficulties inherent in other prior art approaches that 
utilize mass spectrometry as described below. 

Any statistically significant change in protein expression level should display an 
inverse labeling pattern in the inverse labeling experiments. For metabolic ^^N-labeling, the 
mass increase upon labeling is a variable depending on the sequence of the peptides (with a 
range of about LO-1.5% of the peptide MW averaged at about 1.2%). The variable or 
unpredictable mass difference makes it extremely difficult to correlate peptide isotope pairs 
using a conventional mass spectrometer if the spectra are highly complexed. The use of 
ultrahigh resolution FT ICR (fourier transform ion cyclotron resonance) MS has been 
suggested for measurement of high accuracy to obtain accurate mass differences between 
peaks and therefore assign peptide isotopic pairs with high confidence. Another possible but 
impractical solution is through the use of tandem MS. The isotopic pair of peptides should 
possess a similar fragmentation pattern and can thus be correlated using their MS/MS data. 
In the application of the inverse labeling method, what one looks for is the qualitative mass 
shifts, not isotopic pattern, nor accurate mass shifts. Therefore there is no stringent 
requirement on resolving power of the MS instruments. A mass shift is readily recognized 
even though the isotopic peaks may not be fully resolved for peptide ions of higher charge 
states using a standard mass spectrometer of unit resohition. The observation/conclusion is 
further supported by the similar fragmentation pattern of the MS/MS data, which is obtained 
for the logical subsequent step in the process of achieving ti© identification of the proteins. 
Redundant work would have to be carried out using the other solutions, either by measuring 
accurate mass differences of multiple signal pairs to select a best-fit pair, or by performing 
MS/MS on all signals and find a correlated pair based on similarity of £ragmentatk>n pattern. 
The approach of using MS/MS fragmentation pattern for achieving correlation of isotope 
pairs not only requires tremendous amount of instrunKUt time to acquire the data, also 
demands major effort in data handling (impossible to do n^ually)* Difficulties would 
always be present when an isotope signal is too weak for an accurate mass measurement or 
getting a useful MS/MS data. When inverse labeling is not performed, ambiguity is a real 
concern when unpaired (isotope) signals are detected in the cases of protein covalent changes 
or extreme changes in expression. Unpaired signals detected can be confused as unlabeled 
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peptides^roteins or chemical backgrounds* A qualitative shift will be observed with inverse 
labeling if a true change has occurred to a protein quantitatively or qualitatively. With the 
inverse labeling approach, one can use any mass spectrometer of standard unit resolution, and 
acquire only the minimum, essential data to achieve the rapid identification of differentially 
expressed protein markers/targets without ambiguity. Relative quantitation of expression 
level, again only on the differentially expressed proteins (or proteins of interest) can be 
performed afterwards if desired. 

The following examples serve to illustrate the invention but do not to limit the scope 
thereof in any way. 

EXAMPLES 
Materials 

^^O-water (95% atom) is purchased from Isotec Inc. (Mianwbiirg, OH). 

*^C-algal protein extract and ^^C-*^N-algal protein extract are purchased from Isotec 
Inc. (Miamisburg, OH). 

ICAT reagent (both light Do and heavy Dg) is purchased from Applied Biosystems 
(Cambridge, MA). 

Example 1 

Inverse ^^O-LabeMng Utilizing an Eight-Protein Model System 

Q^mmercial proteins of BSA, aldolase, carbonic anhydrase, p-casein, chicken 
albumin, ^o-transferrin, p-lactogk»bulin, and cytochrome C (Sigma) are used without further 
purification. The eight proteins are mixed at amolar ratio of 1:1:1:1:1:1:1:1 for the "controF 
and 0,3:3:1:1:1:1:1:1 for the **treated" pool Two Identical aliquots containing 10 pmol each 
of the unchanged components are taken from each pool and are dried using a Speedvac. The 
^^O-labeling is performed using two procedures, during proteolysis and post-proteolysis. For 
proteolysis labeling, one of the dried aliquots is reconstituted with 20 ^il of regular water and 
the other with 20 \xl of ^^O-water, both containing 50 mM ammonium bicarbonate. Trypsin 
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(Modified, Promega) at a 1:100 trypsin-to-protein ratio (wt:wt) is added to each solution and 
digestion is allowed to proceed at SV^'C for -20 hrs. For the post-proteolysis labeling, all 
trypsin digestions are performed in regular water-ammonium bicarbonate buffer at the same 
trypsin to protein ratio for --12 hrs. The resulting peptide mixtures are then taken to conqjlete 
dryness with a Speedvac. 10 jliI of ^^O or regular water are added respectively to the dried 
peptide mixtures for post-proteolysis ^^O-labeling* The process is allowed to proceed at room 
temperature for -12 hrs. Prior to analysis, for both during-proteolysis and post-proteolysis 
labeling, the ^^O-control sample is mixed with the *^0-**treated" san:5>le and the ^^Ocontrol 
sample is mixed with the ^®0-'*treated" san^le. The same MS analysis is performed on both 
mixtures. 

Example 2 

Inyerse ^^O-LabeKng UtiBbring Whole Cell Lysate Spiked Wifli FTP (Protein Tyrosfaie 
Phosphatase) 

Approximately 5 x 10^ harvested CHO cells are lysed mechanically (freeze^aw) 
using a buffer containing 10 mM Tris, 1 mM EDTA, pH 7.4. The resulting cell lysate of 
2.5 ml at 0.4 mg/ml protein concentration is divided into four aliquots. Two are ^Hced with 
10 pmol of PTP-IB protein (internally expressed, residue 1-298) OPTPIO) and the other two 
with 30 pmol of FTP-IB (PTP30), Trypshi is added to each solution at a 1:100 (wtrwt) 
trypsin-to-total protein ratk> to initiate the digestion. The proteolysis is allowed to proceed at 
37°C for --12 hrs. The resulting solutions are centrifuged and the solid discarded. The 
solutions are then taken to complete dryness with a Speedvac. For both FTP 10 and PTP30» 
one of the two identical aliquots is reconstituted with 10 fil of ^^O-water, the other with 10 jil 
of regular water. The post-proteolysis *^0-incorporation is allowed to proceed at room 
temperature for -12 hrs. Prior to analysis, the ^^OPTPIO and ^*OPTP30 samples are mixed, 
and so are the ^^O-PTPIO and ^^O-PTPSO samples. E^h mixture is diluted widi 100 jiil of 
mobile phase A (0,1% formic acid - 0.01% TFA in water) and filtered through a 0.4 jim 
Microcon filter. The ISltrate is injected to LC/MS for analysis. 
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Example 3 

LC/MS aBd LC/MS/MS Peptide Analysis of Inverse ^®0-LabeIed Peptide Mixtures 

MS analysis of the inverse *^0-labeled peptide mixtures is carried out throu^ LC-BSI 
MS using a Finnigan LCQ ion trap mass spectrometer- A 1 .0 x 150 mm Vydac CI 8 column 
is employed for on-line peptide separation with a gradient of 2-2-20-45-98-98% B at 0-2-10- 
65-66-70 min. The mobile phase A is 0.1% formic acid - 0.01% TFA in water and B is 0.1% 
formic acid- 0,01% TFA in acetonitrile. The flow rate is 50 ^1/min. Post-LC column, the 
flow is split 9:1 with about 5 jil/min going into MS and 45 jd/min being collected for kter 
use. LCQ ion trap mass spectrometer is operated at a data-dependent mode automatically 
performing MS/MS on the most intense ion of each scan when the signal intensity exceeds a 
pre-set threshold. When needed, the collected samples are concentrated and re-analyzed to 
ol^mx MS/MS data that are not collected automatically in the first run for the peptides of 
interest* The relative collision energy is set at 45%. Under this condition, most peptides 
fragment effectively hi our experience. An 8-Da window for precursor ion selection is 
enqjloyed, 

Eixample 4 

MALDI TOF MS Peptide Analysis of Inverse ^*0«Labeled Peptide Mixtures 

The mixture samples are simply diluted 1:3 to 1 :5 using the MALDI matrix solution 
(saturated a-cyano-4-hydroxy cinnamic acid in 50% acetonitrile - 0,1% TFA) and -1 jil of 
the final solution (containing about 500 ftnol each based on the unchanged components for 
the eight-protein system) are loaded onto MALDI target for analysis. The analysis is 
performed on a Broker REFLEX HI MALDI TOF mass spectrometer operated in the 
reflectron mode with delayed ion extraction. When applicable, post source decay (PSD) is 
also performed on the peptide ions of interest. 
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Example 5 

Database Search of Inverse ^*0-Labeled Peptides 

Search software PROWL (Proteometrics, New York, NY) and MASCOT (Matrix 
Science, London, UK) are used to search protein databases to identify proteim using peptide 
fingerprints, MS/MS fragments, and processed PSD spectra. For searches using peptide 
fingerprint information, peptide ions exhibiting the inverse labeling pattern or mass shift of 2 
or 4 Da on the most abundant isotopic ion between the two inverse labeling experiments are 
sorted out based on the direction of mass shift (up or down). Each Hst is used separately for a 
database search to identify the proteins. For searches using peptide sequence information, the 
MS/MS spectra of a peptide from the two inverse labeling e3q}eriments are compared and Y 
ions with a mass shift of 2 or 4 Da are identified^ These ions are used alone or in 
combination with B ions to search protein databases to obtain identification of the proteins. 
An iterative search combining the data of the peptide map and MS/MS is also performed. 
Any ionjs that demK>nstrate a clear inverse labeling pattern in the map and are supported by 
mass shifts of fragment ions in MS/MS data are identified first using their MS/MS 
fragments/sequence tags* The peptides associated with the identified proteins are then 
reirK>ved from the list and a second round search is initiated using the masses of the 
remaining peptides. For the ions for which no convincing conclusion could be made, a 
second analysis using the collected sample is performed to obtain MS/MS data on them Hie 
resulting data are used in the same manner to search the databases for protein identification. 

Example 6 

MS Analysis of Inverse ^*0«-Labellng Method Using the Eight-Protein Model System 

The inverse ^^O-labeling and MS analysis are performed in a similar fashion as shown in 
Figure 1 on the eight-protein model system where BS A is "down-regulated" by 3-fold and 
aldolase 'nip-regulated'* by 3-fold. When analyzed using an LCQ with on-line RP LC, a clear 
inverse labeling pattern or a 2/4 Da mass shift is observed for a number of peptides 
(TFigures 2-3 (A, B)). Following data analysis, two lists of peptide masses that are based on 
the direction of the mass shift are quickly formed. When each is used separately to search the 
database, aldolase is exclusively identified using the list of 2/4 Da downward shift, 
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corresponding to an up-regulation of protein expression, while BSA is identified using the list 
of upward mass shift, which corresponds to a down»regulation in protein expression. MS/MS 
spectra are obtained automatically at data dependent mode on a few of the peptides. An 
iterative search scheme is also applied, using the combmed mass list of all that shifted, 
regardless of the direction of the shift- Once a protein is identified with high confidence 
(aldolase in this case), with either the mass Hst or an MS/MS spectrum, the related peptides of 
the piotein are removed from the mass list. A second search is then performed on the 
remaining list to identify the second most prominent protein (BSA in this case). As a 
consequence of inverse labeling, very rich information is embedded in the MS/MS data. 
First, smce the label is incorporated at the C-terminus of each peptide, Y ions in an MS/MS 
spectrum are the fragments carrying the label and exhibit the characteristic inverse labeling 
pattern for proteins that are diffemntially expressed. As shown in Figures 2-3 (C, D), for 
proteins whose "expression level" is significantly altered by '^perturbation", the inverse 
labeling pattern or a 2/4 Da mass shift observed at the molecular ion level on the peptides is 
passed on to the Y ions in the MS/MS spectra. The observation of the characteristic inverse 
labeling pattern on the fragment ions in the MS/MS spectra provides further verification and 
confirmation of protein differential expression. Since most peptide fragments carry fewer 
charges than the parent molecule (mostly singly charged in the figures shown in this paper), 
the mass shift is more prominent and thus is easier to recognize compared to that from their 
multiply charged precursor ion. Secondly, the inverse labeling pattern that is reflected in 
Y ions in the MS/MS spectra, in turn, offers a very convenient way to identify Y ions and 
B ions for the interpretation of an MS/MS spectmm. The fragments with mass shifts are Y 
and Y-related ions and the ones without mass shift are B or Bkrelated ions. Although 
iaterpretation is not required to search the databases using MS/MS data, added specificity 
helps to increase efficiency and accuracy of protein identification via database search. Both 
BSA and aldolase are positively identified using the MS/MS data and the Y/B ion 
assignments (Figures 2-3). In fact, all expected proteins are identified using the MS/MS data 
and the Y/B ion assignments (Figures 2-3 and 5-6). These advantages are of more 
iiqportance when one deals with novel proteins where de novo sequencing is required. The 
ability to assign Y and B ions greatly f aciUtates "read ouf * of the sequence from an MS/MS 
spectrum. Although accurate quantitation of protein expr^ion is not the intended use of the 
metibLod, the information is available in both MS and MS/MS data, if one desires to perform 
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the task (i.e., signal intensities of ^^O to ^^O after correction of the natural ^^C-isotopic 
contribution), MALDI TOF MS performed directly on the mixture without any separation 
results in a peptide-map spectrum that shows severe overlap, which makes data interpretation 
difficult (Figure 4 (A, B)). Nonetheless, the inverse labeling pattern can still be observed for 
a number of ions (Figure 4 (C, D)). PSD is carried out on a few of the ions and the proteins 
are able to be identified using the PSD data (Figure 5). 

Example 7 

MS Analysis of Inverse "O-Labeling Using PTP-Spiked Cell Lysate System 

On the whole cell lysate system where PTP-IB protein is spiked in at two different 
levels with the intention to mimic a complex protein mixture system, a lot of peptide signals 
with good signal intensities are detected (data not shown). Even with on-line LC separation, 
severe overlapping is expected and, indeed, observed. Nonetheless, when the two sets of data 
from inverse labeling are analyzed and compared, a few ions are identified with the 
characteristic inverse labeling pattern, primarily with a 4 Da shift (Figure 6 (A, B)). The split 
and collected san^les are subjected to a second round of analysis to obtain their MS/MS data. 
The MS/MS data with Y ions exhibiting the inverse labeling pattem of a 4 Da shift between 
the two parallel experiments further verify/ confirm the mass shift observed on the precui^r 
peptides and, thus, the differential expression of the protein (Figure 6 (C, D)), A database 
search using the readily recognized Y ions of mass shift leads to the conclusive identification 
of the protein as human PTP-IB. In this partk?alar case with whole cell lysate, as expected, 
MALDI MS peptide mapping does not provide much useful information due to severe 
overlapping of the peptide signals (data not shown). 

Unlike metabolic labelmg of proteins during cell culture (*^C/^^/^HO, this sqpproach 
doesn't require any ^Mial skill and/or facility. Also, analysis of tissue proteins and 
identification of marker/target proteins from tissues can be readily performed. Unlike 
chemical labeling, this method does not involve addMonal reaction/work-up steps. Thus, it 
avoids potential sample loss associated with the additional steps. Another pitfall associated 
with the residue-specific chemical labeling, namely, high likelihood of losing post- 
translational modification information, is also avoided. Because two collaborative analyses 
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are peifornied with the inverse labeling metihod, signals of no isotopic counterpart detection 
either due to extrenie changes in expression level and the dynamic range limitation of MS 
detection or covalent modifications of proteins can identified without ambiguity. 

Example 8 

Inverse ^^N-I^beling Utilizing a Two-Protein Model System 

Regular and *%-labeled PTP protein (1-298) and regular and ^^N-labeled HtrA 
protein (161-373) are internally prepared using standard culture conditions with the 
*^N-labeled materials being produced by fermentation in ^^N-enriched culture media. The 
authenticity of the proteins and the level of isotope incorporation are assessed by MS on the 
final protein products. The labeling yield is better than 90% for both proteins according to 
MS results. The two-protein model systems are made by mixing together the two individual 
proteins, PTP and HtrA, with the regular ^''N-mixture being the nrixture of the two 
^^-pioteins, and the *^N-mixture as the mixture of the two ^^N-labeled proteins. The 
"control" is a mixture of two proteins at a nK>lar ratio of 1:1. The *1:reated** or "altered state** 
materials are made to naimic four different levels of **protein differential expression" for PTP 
protein while the level of "expression" of HtrA remains unchanged* The molar ratios of 
PTP:HtrA for the four "treated" nrixtures are 3:1, 100:1, 0.3:1, 0.01:1 mimicking a 3-fold and 
a 100-fold up-regulation and a 3-fold and a 100-fold down-regulation, respectively. The 
regular ^''N-mixtures and the labeled ^^-mixtures are made in the same mamier. To perform 
the inverse labeling e3q>eriments, an aliquot of *^N-control is mixed with an aliquot of 
*^-**treated" (each containing the san» amount of HtrA protein) while the inverse labeling fe 
achieved by combining the "^-control with the *'*N-**treated" in the same fashion. (Two 
inverse labeling mixtures are thus produced for each con?)arative proteomic experiment) 
The same procedure is performed for all four "differencial" levels. The subsequent trypsin 
digestion is carried out on all the mixtures at a 1:50 trypsin-to-protein ratio (wt:wt) (Modified 
trypsin from Promega, sequencing grade) at 37*'C for -7 hrs in 50 mM ammonium 
bicarbonate buffer (the two proteins are known to readily digest under this condition without 
prior reduction and alkylation). MS analysis using both MALDI and electrospray LC/MS is 
performed on all peptide mixtures. Allquots each containing 10 prool of HtrA peptides are 
used for the LC/MS analysis. 
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Example 9 

Inverse ^^N-Labeling Utilizing Algal CeH Lysate Spiked With FTP Protein 

A 1 ml solution containing 6M Gnanidine HCl-50 mM Tris-50mM NaQ pH 7.4 is 
added to 10 mg each of a ^^C-algal protein extract and a ^^C-^^N-algal protein extract The 
mixtures are vortexed and sonicated for 40 min to sohibilize the proteins. After centrifuge at 
20,000 RPM for 20 min, the supematants are taken out for further use. A large amount of 
insoluble is discarded. 10 mM DTT is added to the solutions and reduction reaction 
continues for 1 hr at 50*^0, Cysteine alkylation is carded out by the addition of 40 mM 
iodoacetic acid sodium salt followed by shaking at room teroperature in the dark for 1 hr. A 
Centricon filter of IkDa MW cutoff is subsequently used to remove the excess reagents and 
to exchange the buffer to 50 mM ammonium bicarbonate. Protein concentration of the 
extracts is measured using the standard Bradford method. 10 pmol of regular FTP protein is 
spiked mto an aliquot of ^^C-algal protem extract containing about 0.05 mg of total protein to 
form the ^^-"contror, and 10 pmol of *^N-PTP is spiked into an aliquot of "C-^^-algal 
protein extract containing about 0.05 mg of total protein as the ^^-"contror. As for the 
"treated'% a 3-fold down-regulation is created by spiking 3 pmol of FTP into an identical 
aliquot of algal extract, and a 100-fold down-regulation is made by spiking 0.1 pmol FTP into 
another equal aliquot of algal extract* The ^'^N-material is the result of ^"^-PTP being spiked 
into the aliquot of "C-algal extract, and, the ^^N-material is produced by spiking '^-PTP 
into aliquot of ^^C-^^N-algal extract. The inverse labeling experhnents proceed in the same 
way by combining aliquots of ^''N-control with ^^N-'treated'% and ^^N-control with 
^'^N-'^treated". Trypsin digestion on the four resulting inverse labeling mixtures (for two 
differential levels) is performed at a 1 : 100 trypsin-to-protein ratio (wtrwt) at 3TC for --1 6 hrs 
in 50 mM ammonium bicarbonate buffer. All digests are analyzed by electrospray LCTMS. 

Example 10 

MALDI TOF MS Peptide Analy^s of The Inverse^^N-Labeled Peptide Mixtures 

All digest mixtures of the two-protem model systems are analy2»d by MALDI TOF 
MS. The mixture samples are diluted 1:5 using the MALDI matrix solution (saturated 
a-cyano-4-hydroxy cinnamic acid in 50% acetonitrile - 0.1% TFA) and -1 jil of each of the 
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final solutions (containing about 500 ftnol of HtrA peptides) is loaded onto MALDI target for 
analysis. The analysis is performed on a Bruker REFLEX III MALDI TOF mass 
spectrometer operated in the reflectron mode with delayed ion extr^on. 

Ebgample 11 

LC/MS And LCTMS/MS Peptide Analyses of The Inverse ^N-Labeled Peptide Mixtures 

All digest mixtures of the two-protein model systems and those jfrom the algal-spiking 
systems are analyzed by LC/MS-MS/MS. The analysis is carried out through electrospray 
LC/MS using a Finnigan LCQ ion trap mass spectronD^ter. A LO x 150 mm Vydac CIS 
column is employed for on-line peptide separation. A gradient program of 2-20-45-98- 
89%% B at 0-10-65-66-70 min is used. Mobile phase A is 0.25% formic acid in water and 
mobile phase B is 0.25% formic acid in acetonitrile. The flow rate is 50 ixVmm. After the 
ehition from the LC column, the flow is split 9: 1 with about 5 |il/rain going into MS and 
45 }il/min being collected for later use. The LCQ ion trap mass spectrometer is operated at a 
data-dependent mode, automatically performing MS/MS on the most intense K>n of each scan 
when the signal intensity exceeds a pre-set threshold. When needed, the collected sait^les 
are concentrated and re-analyzed to obtain MS/MS data that are not collected automatically 
in the first ran for the peptides of interest. The relative collision energy is set at 45% at 
which most peptides fragment effectively. A 5-Da window for precursor ion selection is 
employed. 

ExamT>le 12 

Database Search of The Inverse^^N-Labdled Peptides 

Search software PROWL (Proteometrics, New York, NY) and MASCOT (Matrix 
Science, London, UK) are used to search the protein databases to identify proteins using 
peptide fingerprints, and MS/MS fragments. For searches using peptide fingerprint 
information, peptide ions exhibiting the inverse labeling pattern between the two inverse 
labeling experiments are sorted out based on the direction of mass shift (increasing or 
decreasing). Each list is used separately for a database search to identify the proteins. For 
searches using peptide sequence information, the MS/MS spectra of a peptide from the two 
inverse labeling experiments are con^ared and their correlation is further verified/confirmed 
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by their similar fragmentation pattern. The MS/MS spectrum of the -peptide (lower in 
mass) is used to search databases for protein identification. 

Example 13 

MS Analysis of Inverse ^^N-Labeling Method Using the Two-Protein Model System 

Direct MALDI analysis is successfully carried out on the mixtures of the two-protein 
model system. Off-line coupling of separation (such as with two-dimensional 
chromatography) with MALDI TOF MS on a digest of a coniplexed protein mixture (e.g., 
total cell lysate) can in each fraction resemble the situation demonstrated here. Li contrast to 
the inverse labeling method, when the single-experiment approach is applied, even for the 
cases where protein differential expression is not so drastic that both isotope pairs are clearly 
detected (e.g., 3-fold change. Figure 7 (A)), correlation of isotopic pairs can stOl be difficult 
to achieve such as that shown in the m/z range of 1550-1600. However, by subtractive 
comparison of two IVlALDI spectra from an inverse labeling experiment (Figures 7-8 (A, B)), 
signal pairs from proteins of no significant differential expression can be subtracted out (such 
as those marked with arrows along the horizontal axis) and result in much simplified spectra 
for easier correlation. When protein differential expression is not too drastic (e.g,, 1000-fold 
or less) and both isotope signals are detected, the reversal in signal intensity ratio is easily 
recognized to support the correlation (Figure 7 (A, B)). Mistakes are more likely to happen, 
if inverse labeling is not used, in correlating isotopic pairs when a more dramatic differential 
e>ipression has occurred such that the weaker isotopic signals are not detected due to the 
dynamic range limitation in MS detection. Falling into the same category is covalent change 
of protein as a result of a perturbation where covalent modifications of proteins occur such as 
protein processing at terminus or post-translational modifications. The peptides bearing the 
covalent changes will be detected without the isotopic counterpart since the modifications are 
not present m the control state. Inverse labeling offers an easy solution to these problems. 
Although a lOO-fold down regulation is not drastic enough for the weaker isotope signals to 
conapletely escape detection, it is a good example to demonstrate the benefits of the approach. 
As shown in Figure 8 (A, B), the inverse labeling pattern is readily recognized after the 
subtractive cleanup of signals from proteins of no significant differential expression. 
(Keeping in mind that the range of nitrogen atoms per peptide sequence shonld normally be 
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larger than 1% of the peptide molecular weight and smaller than 1,5% of the peptide MW, 
and averaged at about L2% MW.) The digestion mixtures from the two-protein model 
systems are also analyzed by electrospray LC/MS (Figure 9 (a-c)). The data suggest that the 
isotopic pairs do not display any significant separation by reverse phase chromatography. A 
quick comparison of the two base-peak ion chromatograms from an inverse labeling 
experiment (Figar& 9 (a)) leads to the rapid identification of the base-peak peptides of inverse 
labeling pattern (mass shifts) or from proteins of diffetential expression* Certainly, one has 
to process the MS data in order to identify other peptides of inverse lateling pattern that are 
in lower abundance and co-ehiting with more abundant peptides. Once the peptide signals 
with inverse labeling pattern are identified, the MS/MS data that are. acquired automatically 
in data-dependent mode of operation are analyzed. Their similar fragmentation pattern would 
verify/confirm the correlation of isotopic pairs and thus the correct conclusion on protein 
differential expression. The data are then used to search protein databases for protein 
identification (Figure 9 (c)). In this case, PTP-IB protein is readily identified from the 
database. In practice, when dealing with a complexed protein system, an iterative search 
scheme combining the data of ions with inverse labeling pattern from peptide map and 
MS/MS may be performed. Any ions that demonstrated a clear inverse labeling pattern in the 
map and are ftirther supported by similar fragmentation patterns of MS/MS data are identified 
first using their MS/MS data (of ^'^N-ion or lower mass). The peptides associated with the 
identified proteins can then be removed from the peptide list and a second round search is 
initiated using the MS/MS data of the remaining peptides of inverse labeling patt^. For 
those ions of no MS/MS data automatically acquired, a second analysis is performed using 
the collected sanople to obtain their MS/MS data. The data are then used in the same nmnner 
to search the databases for protein identification. 

Example 14 

MS Analysis of Inverse ^N-Labeling MeOiod Using the Spiked Algal Cell Lysate System 

To demonstrate the application of the approach in a more complexed mixture, 
PTP-IB protein, both non-labeled and ^^-labeled, are spiked into algal cell lysate - ^^C and - 
^^C/^^N, respectively, at different levels (3--fold and lOO-fold down-regulation) to mimic 
protein differential expression. The inverse labeling experiment is then performed and the 
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mixtures are analyzed by LC/MS-MS/MS. When two sets of data from each inverse labeling 
experiment are compared, a number of ions possessing the characteristic inverse labeling 
mass shifts are extracted (Figure 10 (A, B)). The split and collected samples are subjected to 
a second analysis to obtain MS/MS on the ions that exhibit the inverse labeling pattern. Their 
similar fragmentation patterns clearly validates the mass shift or inverse labelmg pattern 
observed on the precursor peptides and, thus, the differential e^qpression of the precursor 
protein (Figure 10 (C, D)). A database search using the MS/MS data of '^-peptide leads to 
the exclusive identification of the human PTP-IB protein. 

Example 15 

Inverse ICAT Labeling Utilizing a Sfx-Protein Model System 

Commercial proteins of BSA, aldolase, p-casein, apo-transferrin, p-Iactoglobulin, and 
cytochrome C (Sigma) are used without further purification. The six proteins are mixed at a 
molar ratio of 1:1:1:1:1:1 for the "control" and 0.3:3:1:1:1:1 for the "treated" pooL The 
recommended protocol is followed. The protein mixtures of control and "treated'* are first 
reduced and denatured. ICAT derivatization is then performed in the inverse labeling way 
(Figure 1), with half of each mixture reacting with Do -ICAT reagent and the remaining half 
reacting with Dg-ICAT reagent. The inverse labeling proceeds by mixing the Do-control with 
the Ds-'*treated", and the Dg-control with the Do-"treated". Trypsin digestion is then 
performed on both mixtures at 1:50 (wt:wt) trypsin-to-protein ratio for --16 hrs at 37*'C. The 
resultant peptide mixtures first go through a cation exchange step for cleaning up the excess 
reagents, denaturant, and reducing agent, etc. They then go through an avidin column for 
affinity enriclnnent of the labeled (cysteine-containing) peptides. Aliquots containing 
10 pmol each of the unchanged components are taken from each pool and are dried using a 
Speedvac* They are reconstituted with mobile phase A prior to LC/MS and M ALDI TOP MS 
analysis- 

Kxamnle 16 

LC/MS And LC/MS/MS Peptide Analyses of diverse ICAT-Labeled Peptide Mixtures 

MS analysis of the ICAT labeled peptide mixtures (see Example 15) is carried out as 
set forth in Example 3 except that a S-Da window for precursor ion selection is employed. 
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Example 17 

MALDI TOF MS Peptide Analysis of biverse ICAT-Labeled Peptide Mixtares 

Aliquots of the Speedvac dried mixture samples from Example 15 are subjected to the 
same procedure as set forth in Example 4. 

Example 18 

Database Search of Inyerse ICAT-Labeled Peptides 

Search software PROWL (Proteometrics, New York, NY) and MASCOT (Matrix 
Science, London, UK) are used to search the protein databases to identify proteins using 
peptide fingerprints and MS/MS fragments* For searches using peptide fingerprint 
information, peptide ions exhibiting the inverse labeling pattem of mass shifts between the 
two inverse labeling experiments are sorted out based on the direction of mass shift 
(increasing or decreasing). Each list is used separately for a database search to identify the 
proteins. An iterative search combining the data of ions with inverse labeling pattem from 
peptide map and MS/MS is also performed. Any ions that demonstrate a clear invert 
labeling pattem in the map and are further supported in MS/MS data by their similar 
fragmentation pattem and fragments with and without mass shifts are identified first using 
their MS/MS fragments. The peptides associated with the identified proteins are then 
removed from the list and a second round search is initiated usmg the masses of the 
remaining peptides of inverse labeling pattem. For those ions for which no convincing 
conclusion can be made, a second analysis is performed using the collected sandple to obtain 
MS/MS data. The resulting data are used in the same mann^ to search the databases for 
protein identification. 

Example 19 

MS Analysis of InveKe ICAT Labeling Method Using the Six-Protein Model System 

The inverse labeling and MS analysis are performed in the same manner as shown in 
Figure 1 on the six-protein model system where BS A is **down-'reguIated*' by 3-fold and 
aldolase ^^up-regulated" by 3-fold. MALDI TOF MS which is performed dfrectly on mixture 
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without any separation, while displaying a large degree of signal overlap, still clearly 
demonstrates how the inverse labeling strategy helps to quickly identify the peptide signals 
derived from proteins of differential expression. Without the inverse labeling strategy one 
would have to evaluate a single spectrum (e.g.. Figure 11 (A)) looking for the ± 8/16/24-Da 
pair for each and every peptide and performing quantitation. Utilizing the inverse labeling 
strategy one only needs to overlay the two spectra (Figure 1 1 (A, B)) and perform "zoom and 
pick" to identify the peaks that show the characteristic mass shift between the two spectra. 
Very quickly (a few minutes in this case) after this exercise of qualitative comparison, the 
peaks of the characteristic inverse labeling pattern are identified (e.g., mass labeled peaks)- It 
is apparent that when applying inverse labeling, a quick qualitative conparison of the two 
data sets can lead to the quick identification of the peptides of interest. Quantitation and PSD 
or MS/MS analysis for protein identification can then be perfomied on those peptides. When 
the same samples are analyzed using an LCQ with on-line RP LC, the characteristic inverse 
labeling pattern of mass shift is also clearly observed on a number of peptides (Fig«r© 12 (A, 
B)). The mass shifts vary depending on the number of cysteines in the sequence and the 
charge state of the peptide being detected. Following data analysis, two lists of peptide 
masses are quickly generated that are based on the direction of the mass shift. These two lists 
are used to search the database. Aldolase is exclusively identifled using the list of decrease in 
mass shift, corresponding to an up-regulation of protein expression. BSA is identified using 
the list of increase in mass shift, corresponding to a down-regulation in protein e?q)ression. 
MS/MS spectra are obtained automatically in data-dependent mode for a number of the 
peptides* In order to emulate a broad-spectrum situation where multiple proteins may be up- 
or down-regulated, an iterative search scheme is also applied. In this case we use the 
combined miass list of all the peptides that show a mass shift, regardless of the direction of the 
shift. After a protein is identified with high confidence using either the mass list or an 
MS/MS spectrum (aldolase in our system), all peptides derived from the protein are removed 
from the mass list. The process is then repeated in order to identify the next protein 
displaying the mass shift (BSA in this case). It should be pointed out that there are additional 
information embedded in the MS and MS/MS data. The mass shifts indicate how many 
cysteins are present in a sequence. When used for database search, this added specificity 
helps to narrow down the candidate Hst and increase the efficiency and accuracy of the search 
results. 
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It will be und^tood that various modifications may be made to the embodiments 
and/or examples disclosed herein. Thus, the above description should not be construed as 
limiting, but merely as exenaplifications of preferred embodiments. Those skilled in the art 
will envision other modifications within the scope and spirit of the claims appended hereto. 
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What is claimed: 

1 . A method for identifying a differentially expressed pixjtein in two different samples 
containing a population of proteins comprising: 

a) providing two equal protein pools from each of a reference sample and an 
experimental sample; 

b) labeling the protein pools with a substantially chemically identical isotopically 
different protein labeling reagent for proteins, wherein one pool from each of the 
reference and experimental pools is labeled with an isotopically heavy protein 
labeling reagent to provide an isotopically heavy-labeled reference pool and an 
Isotopically heavy-labeled experimental pool, and wherein the remaining 
reference and experimental pools are labeled with an isotopically light protein 
labeling reagent to provide an isotopically light-labeled reference pool and an 
isotopically light-labeled experimental pool; 

c) combining the isotopically light-labeled reference pool with the isotopically 
heavy-labeled experimental pool to provide a first protein mixture; 

d) combining the isotopically heavy-labeled refia^nce pool with the isotopically 
light-labeled experimental pool to provide a second protein mixture; 

e) detecting the labeled proteins from each of the two mixtures; and 

f) comparing the labeling pattern obtained for the labeled proteins in the first and 
second mixtures, wherein an inverse labeling pattern of a protein in the second 
mixture compared with the labeling pattern of the protein in the first xnixture is 
indicative of the differentially expressed protein in the two different samples. 

2. The method of claim 1, which further comprises enzymatically or chemically cleaving 
the labeled proteins in the first and second mixtures to provide peptide mixtures prior to 
step (e). 

3. The method of claim 2, which further comprises sequencing one of the peptides to 
identify tibe differentially expressed protein from which the peptide originated. 
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4. The method of claim 3, wherein sequencing of the peptide is performed utilizing 
tandem mass spectrometry or post source decay (PSD). 

5. The method of claim 1 , which further comprises sequencing the differentially 
expressed protein to identify the protein, 

6. The method of cl^ 5, wherein sequencing of the differentially e^^ressed protein is 
performed utilizing tandem mass ^ectrometry or FSD. 

7. The method of claim 1, which further comprises separating the labeled proteins from 
each of the first and second mixtures prior to step (e), 

8. The method of claim 7, wherein the step of separating the labeled proteins from the 
two mixtures is carried out using a technique selected from the group consisting of 
ammonium sulfate precipitation, isoelectric focusing, size exclusion chromatography, ion 
exchange chromatography, adsorption chromatography, reverse phase chromatography, 
affinity chromatography, ultrafiltration, immunoprecipitation and combinations thereof. 

9. The method of claim 2, which further comprises separating the labeled peptides from 
each of the first and second mixtures prfor to step (e). 

10. The method of claim 9, wherein the step of separating the labeled peptides from the 
two mixtures is carried out using a technique selected from the group consisting of size 
exclusion chromatography, ion exchange chromatography, adsorption chromatography, 
reverse phase chromatography, affinity chromatography, immunoprecipitation and 
combinations thereof. 

1 1 . The method of claim 1 , wherein the labeled proteins are detected by mass 
spectrometry. 

12. The method of claim 2, wherein the labeled peptides are detected by noass 
spectrometry. 
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13. The method of claim 1 , which farther comprises subjecting the samples to at least one 
fractionation technique to reduce the complexity of proteins in the samples prior to step (a). 

14. The method of claim 2, which further comprises subjecting the isotopically labeled 
proteins of the first and second mixtures to at least one fractionation technique to reduce the 
complexity of proteins in the first and second mixtures prior to cleaving the labeled proteins 
in the first and second mixtures. 

15. The method of claim 13, wherein the fractionation technique is selected from the 
group consisting of ammonium sulfate precipitation, isoelectric focusing, size exclusion 
chromatography, ion exchange chromatography, adsorption chromatography, reverse phase 
chromatography, affinity chromatography, ultrafiltration, inmiunoprecipitation and 
combinations thereof, 

16. The method of claim 1, wherein the two samples differ in cell type, tissue type, 
physiological state, disease state, developmental stage, environmental conditions, nutritional 
conditions, chemical stimuli or physical stimuli 

17. The method of claim 1, wherein the isotopically heavy protein labeling reagent 
contains a stable heavy isotope selected from the group consisting of ^H, ^"^C, ^^N, ^'O, '^O 
and^S. 

18. The method of claim 1, wherein the isotopically light protein labeling reagent 
contains a stable light isotope selected from the group consisting of H, ^^C, ^"^N, ^^O and ^^S. 

19. The method of claim 1, wherein the isotopically heavy protein labeling reagent 
contains and the isotopically light protein labeling reagent contains ^^O. 

20. The method of claim 1 , wherein the protein labeling reagent contains an affinity tag. 
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21. The method of claim 1, wherein the samples are selected from the group consisting of 
cell homogenates, cell fractions, tissue homogenates, biological fluids, tears, feces, saliva and 
lavage fluids. 

22. The method of claim 1, wherein the differentially expressed protein is selected from 
the group consisting of cell surface proteins, membrane proteins, cytosolic proteins and 
organelle proteins, 

23. A method for identifying a differentially expressed protein in two different samples 
containing a population of proteins comprising: 

a) providing two equal protein pools from each of a reference sample and an 
experimental sample; 

b) proteolyzing each protein pool during labeling of each of the protein pools with 
isotopically labeled water, wherein one pool from each of the ref^ence and 
experimental pools is labeled with *^0-water to provide an ^*0-labeled reference 
pool and an **0-labeled experimental pool, and wherein the remaining reference 
and experimental pools are labeled with ^^O- water to provide an ^^O-labeled 
reference pool and an *^0-labeled experimental pool; 

c) combining the ^^O-labeled reference pool with the ^^OJabeled experimental pool 
to provide a first mixture containing ^^O- and "O-labeled peptides; 

d) combining the ^^O labeled reference pool with the ^^Olabeled experimental pool 
to provide a second mixture containing ^^O and ^^O-labeled peptides; 

e) detecting the labeled peptkles from each of the two mixtures; and 

f) con^aring the labeling pattern obtained for the labeled peptides in the first and 
second noixtures, wherein an inverse labeling pattern obtained for a peptide in the 
second mixture compared with the labeling pattern obtained for the p^tide in the 
first mixture is indicative of the differentially e3q>ressed protein from which the 
peptide originated. 
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24. The method of claim 23, which further comprises separating the labeled peptides in 
the two mixtures prior to step (e). 

25. The method of claim 24, wherein the step of separating the labeled peptides in the two 
mixtures is carried out using a technique selected from the group consisting of size exclusion 
chromatography, ion exchange chromatography, adsorption chromatography, reverse phase 
chromatography, affinity chromatography, immunoprecipitation and combinations thereof. 

26. The method of claim 23, wherein detection of the labeled peptides is carried out by 
mass spectrometry. 

27. The method of claim 23, which further comprises sequencing one of the peptides to 
identify the differentially expressed protein from which the peptide originated. 

28. The method of claim 27, wherein sequencing of the peptide is performed utilizing 
tandem mass spectrometry or PSD, 

29. The method of claim 23, which further cornprises subjecting the samples to at least 
one fractionation technique to reduce the complexity of proteins in the samples prior to 
step (a). 

30. The method of claim 23, which further comprises subjecting the labeled peptides of 
the first and second mixtures to at least one fractionation technique to separate undesirable 
peptides from the first and second mixtures prior to step (e), 

3 1 . The method of claim 29, wherein the fractionation technique is selected from the 
group consisting of ammonium sulfate precipitation, isoelectric focusing, size exclusion 
chromatography, ion exchange chromatography, adsorption chromatography, reverse phase 
chromatography, affinity chromatography, ultrafiltration, immunoprecipitation and 
combinations thereof. 
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32. The method of claim 23, wherein the samples are selected from the group consMing 
of cell homogenates, cell fractions, tissue homogenates, biological fluids, tears, feces, saliva 
and lavage fluids. 

33. The method of claim 23, wherein the differentially expressed protein is selected from 
the group consisting of cell surface proteins, membrane proteins, cytosolic proteins and 
organelle proteins. 

34. The method of claim 23, wherein the two samples differ in cell type, tissue type, 
physiological state, disease state, developmental stage, physiological state, environmental 
conditions, nutritional conditions, chemical stimuli or physical stimculi. 

35. A method for identifying a differentially expressed protein in two different sanxples 
containing a population of proteins comprising: 

a) providing two equal protein pools from each of a reference sample and an 
experimental sample; 

b) proteolyzing the proteins in each of the protein pools to provide peptide pools; 

c) labeling each peptide pool with isotopically labeled water, wherein one peptide 
pool from each of the reference and experimental pools is labeled with ^^O-water 
to provide an ^^O-labeled reference peptide pool and an ^®04abeled experimental 
peptide pool, and wherein the remaining reference and experimental peptide pools 
are labeled with ^^O-water to provide an ^^O-labeled reference peptide pool and an 
*^0-labeled experimental peptide pool; 

d) combining the ^^Olabeled reference pool with the ^^O-labeled experimental pool 
to provide a first mixture containing and **0-labeled peptides; 

e) combining the ^^Olabeled reference pool with the ^^O-labeled experimental pool 
to provide a second mixture containing ^^O- and ^^O-labeled peptides; 

f) detecting the labeled peptides from each of the two mixtures; and 
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g) comparing the labeling pattern obtained for the labeled peptides in the first and 
second mixtures, wherein an inverse labeling pattern obtained for a peptide in the 
second mixture compared with the labeling pattern obtained for the peptide in the 
first mixture is indicative of the differentially expressed protein from which the 
peptide originated. 

36. The method of claim 35, which farther comprises separating the labeled peptides from 
the first and second mixtures prior to step (f). 

37. The method of claim 36, wherem the step of separating the labeled peptides from the 
two rmxtures is carried out using a technique selected from the group consisting of size 
exclusion chromatography, ion exchange chromatography, adsorption chromatography, 
reverse phase chromatography, affinity chromatography, mmiunoprecipitation and 
combinations thereof. 

38. The method of claim 35, wherein detection of the labeled i?eptides is carried out by 
mass spectrometry, 

39. The method of clahn 35, which further comprises sequencing one of the peptides to 
identify the differentially expressed protein from which the peptide originated, 

40. The method of claim 39, wherein sequencing of the peptide is performed utilizing 
tandem mass spectrometry or PSD. 

41. The method of claim 35, which further con^arises subjecting the samples to at least 
one fractionation technique to reduce the complexity of proteins in the samples prior to 
step (a). 

42. The method of claim 35, which ftirther comprises subjecting the labeled peptides of 
the first and second mixtures to at least one fractionation technique to separate undesirable 
peptides from the first and second mixtures prior to step (e). 
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43. The method of claim 41 , wherein the fractionation technique is selected from the 
group consisting of ammonium sulfate precipitation, isoelectric focusing, size exclusion 
chromatography, ion exchange chromatography, adsorption chromatography, reverse phase 
liquid chromatography, affinity chromatography, ultrafiltration, immunoprecipitation and 
combinations thereof* 

44. The method of claim 35, wherein the samples are selected from the group consisting 
of cell homogenates, cell fractions, tissue homogenates, biological fluids, tears, feces, saliva 
and lavage fluids. 

45. The method of claim 35, wherein the differentially expressed protein is selected from 
the group consisting of cell surface proteins, membrane proteins, cytosolic proteins and 
organelle proteins. 

46. The method of claim 35, wherein the two samples differ in cell type, tissue type, 
physiological state, disease state, developmental stage, physiological state, environmental 
conditions, nutritional conditions, chemical stimuli or physical stimuli. 

47. A method for identifying a differentially expressed protein in two different samples 
containing a population of proteins comprising: 

a) providing two equal protein pools from each of a reference sanrple and an 
experimental san^le wherein one pool from each of the reference and 
experimental pools is produced by cultivation in a medium containing an 
isotopically heavy-labeled assimilable source to provide an isotopically heavy- 
labeled reference pool and an isotopically heavy-labeled experimental pool, and 
wherein the remaining reference and experimental pools are produced by 
cultivation in a medium containhig an isotopically light-labeled asshaailable source 
to provide an isotopically light-labeled reference pool and an isotopically light- 
labeled experimental pool; 

b) combining the isotopically light-labeled reference pool with the isotopically 
heavy-labeled experimental pool to provide a first protein mixture; 
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c) combining the isotopically heavy-labeled reference pool with the isotopically 
light-labeled experimental pool to provide a second protein mixture; 

d) detecting the labeled proteins from each of the two mixtures; and 

e) comparing the labeling pattern obtamed for the labeled proteins in the fkst and 
second mixtures, wherein an inverse labeling pattern of a protein in the second 
mixture compared with the labeling pattern of the protem in the first mixture is 
indicative of the differentially expressed protein in the two different samples, 

48. The method of claim 47, which further comprises enzymatically or chemically 
cleaving the labeled proteins in the first and second mixtures to provide peptide mixtures 
prior to step (d), 

49, The method of claim 47, wherein the assimilable source is selected from the group 
consisting of ammonium salts, glucose, water and anaino acids. 
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Figure 4 
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Figure 5 
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Figure 6 
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Figure 9 
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Figure 11 
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